Model Selection

Kinetics-400 Pretraining

# Kinetics-400 Pretraining

Timesformer Large Finetuned K400

TimeSformer is a video classification model based on spatio-temporal attention mechanism, specifically designed for video understanding tasks.

Video Processing

Timesformer Base Finetuned K400

TimeSformer is a video classification model based on spatio-temporal attention mechanism, specifically fine-tuned for the Kinetics-400 dataset.

Video Processing

Videomae Base Short

VideoMAE is a video self-supervised pretraining model based on Masked Autoencoder (MAE), which learns internal video representations through masked patch prediction, suitable for downstream tasks like video classification.

Video Processing

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase